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Data Visualization in R 
1. Overview 


Michael Friendly 
SCS Short Course YORK 


Sep/Oct, 2018 E Ü 


http://datavis.ca/courses/RGraphics/ UNIVERSITÉ 


UNIVERSITY 


Course outline 


1. Overview of R graphics 
2. Standard graphics in R 


— qu " 


4. ggplot2 


Outline: Session 1 


* Session 1: Overview of R graphics, the big picture 
=" Getting started: R, R Studio, R package tools 
= Roles of graphics in data analysis 
e Exploration, analysis, presentation 
= What can | do with R graphics? 


e Anything you can think of! 


e Standard data graphs, maps, dynamic, interactive graphics — 
we'll see a sampler of these 


* R packages: many application-specific graphs 
=" Reproducible analysis and reporting 

e knitr, R markdown 

e R Studio 


Outline: Session 2 


* Session 2: Standard graphics in R 


" Robject-oriented design 


input : 
data.frame model | 
object 


=" Tweaking graphs: control graphic parameters 
* Colors, point symbols, line styles 


glm() 
polr() 
multinom() 


plot(mod) 
plot(f(mod)) 


* Labels and titles 


=" Annotating graphs 
* Add fitted lines, confidence envelopes 


Outline: Session 3 


* Session 3: Grid & lattice graphics 
= Another, more powerful "graphics engine" 
€ All standard plots, with more pleasing defaults 


= Easily compose collections (“small multiples") 
from subsets of data 


€= vcd and vcdExtra packages: mosaic plots and 
others for categorical data 


Lecture notes for this session are available on the web page 


Outline: Session 4 


* Session 4: ggplot2 


=" Most powerful approach to statistical graphs, 
based on the "Grammar of Graphics" 

= A graphics language, composed of layers, "geoms" 
(points, lines, regions), each with graphical 
"aesthetics" (color, size, shape) 

= part of a workflow for "tidy" data manipulation 
and graphics 


Resources: Books 


R Graphics 


Second Edition 


| 
R Graphics 
Cookbook 


Paul Murrell, R Graphics, 2nd Ed. 


Covers everything: traditional (base) graphics, lattice, ggplot2, grid graphics, maps, network diagrams, ... 
R code for all figures: https://www.stat.auckland.ac.nz/~paul/RG2e 


Winston Chang, R Graphics Cookbook: Practical Recipes for Visualizing Data 
Cookbook format, covering common graphing tasks; the main focus is on ggplot2 


R code from book: http://www.cookbook-r.com/Graphs/ 
Download from: http://ase.tufts.edu/bugs/guide/assets/R%20Graphics%20Cookbook.pdf 


Deepayn Sarkar, Lattice: Multivariate Visualization with R 
R code for all figures: http://Imdvr.r-forge.r-project.org/ 


Hadley Wickham, ggplot2: Elegant graphics for data analysis, 2nd Ed. 
1st Ed: Online, http: lot2.org/book 

ggplot2 Quick Reference: http://sape.inf.usi.ch/quick-reference lot2 

Complete ggplot2 documentation: http://docs.ggplot2.org/current 


Resources: cheat sheets 


R Studio provides a variety of handy cheat sheets for aspects of data analysis & 


graphics See: https://www.rstudio.com/resources/cheatsheets/ 
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Download, laminate, 
paste them on your 
fridge 


Getting started: Tools 


* To profit best from this course, you need to install 
both R and R Studio on your computer 


The basic R system: R console (GUI) & packages 
Download: http://cran.us.r-project.org/ 


Add my recommended packages: 
source(“http://datavis.ca/courses/RGraphics/R/install-pkgs.R”) 


The R Studio IDE: analyze, write, publish 
Download: 


https://www.rstudio.com/products/rstudio/download 
Add: R Studio-related packages, as useful 


ggplot2 


R package tools 


ggplot2 


© 0o © 9 


The tidyverse 


Data prep: Tidy data makes analysis and graphing 
much easier. 
Packages: tidyverse, comprised of: tidyr, dplyr, lubridate, ... 


Components 


R graphics: general frameworks for making standard and custom graphics 
Graphics frameworks: base graphics, lattice, ggplot2, rgl (3D) 

Application packages: car (linear models), vcd (categorical data analysis), heplots 
(multivariate linear models) 


Publish: A variety of R packages make it easy to write and publish research reports 
and slide presentations in various formats (HTML, Word, LaTeX, ...), all within R 
Studio 


Web apps: R now has several powerful connections to preparing dynamic, web- 
based data display and analysis applications. 
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Getting started: R Studio 


È nstu 
Ele Edit View Workspace Plots Tools Help 


Console @ Workspace History ez 


£P Loa Ej Saver L Import Dataset» | yf Clear All 


command history 
workspace: your variables 


Files Plots Packages Help ec 
Qj New Folder | Q | Delete | 7 Rename {$ More- z 
@ Home 
a Name Size Modified ^ 
2) .Rhistory 1.6 KB Jun 10, 2011, 1:59 PM 
j 20070724 data.xls 13.5 KB Jan 7, 2008, 11:51 PM 
toHotkey.ahk 11.3 KB Feb 28, 2011, 12:04 PM 
le S 10.3 KB Apr 21, 2009, 10:00 AM 


@ code 
cuznsim s: | ots 29.5 KB Aug 5, 2008, 4:32 PM 
Default.rdp| 2KB Dec 20, 2010, 8:30 AM 


(just like Rterm) E packages 


@ Downloads 

@ eagle 

C ExpressPCB 

..] facs-log.xls e D 
] FlowJo75.prefs 

—] funding ops  deadearly2007.txt 918 bytes De 


] Geneious Backup 2011-04-27.zip 3.8 MB Apr 27, 2011, 1:36 PM 
..] Geneious Backup 2011-05-16.zip 6.5 MB May 16, 2011, 5:55 PM 


1 KB Oct 22, 2007, 11:29 AM 
6.4 KB Mar 9, 2011, 5:01 PM 
28, 2006, 11:47 AM 


C 


R Studio navigation 


R folder navigation commands: 


e Where am I? 
» getwd() 


[1] "C:/Dropbox/Documents/6135" 


e Go somewhere: 


> setwd("C:/Dropbox") 
> setwd(file.choose()) 


R Studio GUI 
* Take R to your preferred directory () 


fi te x P 
@ File Edit Code View Plots | Session Project Build Tools Help 


Vid. nter 
x | 
0 ] Untitled * ~ Restart R 
Source on Save 41 f° Terminate R.. HR 
1 r : z 
Set Working Directory 8 To Source File Location 
ET Load Workspace... To Files Pane Location 
| Save Workspace As... Choose Directory... — Cri *Shifr* K 
LI tà (Top Level 


| & Clear Workspace... Select and change ic anew working directory | 
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R Studio projects 


EEEECET-— Í , 
— R Studio projects are a handy way to 


K) Psychological Methods Paper + O rga n ize yo u r WO rk 


Environment History Build 9X. New Project... 
CT Ej. Import Dataset ~ 4 2 Open Project... Mew Project — 
dh Global Environment ~ Open Project in New Session... Create Project 
Close Project 
R New Directory 
Environment! Psychological Methods Paper T) Start a project in a brand new working directory | 
histdata zi | | 
im Existing Directory | 
matlib zi | Associate a project with an existing working directory | 
WordPools 7 
£ Version Control 
EqCov e Se Checkout a project from a version control repository 
Lahman T) 
heplots AR Cancel 
Files Plots Packages Help Viewer . . 
o candisc zi — 
New Fold Delet Renam cw role 
Qj Ne er Q | Delete [$] Rename 4 vodisira D 
C: > Dropbox > Psychological Methods Pa . Back Create New Project 
Name wa un - 
t. Clear Project List pusctory name: _ EM 
p FriendlySigal2017- TQMP.pdf Project Options R Cp Meese ae 
mi |C/R/projects Browse... 
È Psychological Methods PaperRproj “zro WOV IZ, ZUIO IZO AVI F m DURS 
(Create a git repository 
] TQMPpaperzip 946.2 KB Nov 7, 2016, 8:23 AM 
* jJ Makefile 3.3 KB Nov 7, 2016, 8:21 AM 
| Makefile.bak 34 KB Nov 6, 2016, 10:50 AM 
] FIGLIST 591B Nov 6, 2016, 10:43 AM » 
Open in new session Create Project Cancel 
] TQMP.dep 951 B Nov 6, 2016, 10:16 AM i [Resa eee 
] TQMPpaper.aux 14.7 KB Nov 6, 2016, 9:53 AM 
] TQMPpaper.log 80.3 KB Nov 6, 2016, 9:53 AM 


13 


R Studio projects 


An R Studio project for a research paper: R files (scripts), Rmd files (text, R "chunks") 


© G/Dropbox/Psychological Methods Paper - RStudio [--][-6- fea) 
File Edit Code View Plots Session Build Debug Profile Tools Help 
Ol-~ 2- T o to file/f n [s| *| Addins ~ ÈI Psychological Methods Paper ~ 
QjTQMPpaperRmd x  @jparenting.Rmd* — ? jRohwerMMRA-exR * — QjSocialCog.Rmd * —?jparenting-exR « — 9 > eal] Environment History Build X New Project... 
T me Q | af knit D- B o =#Run -| S |[=)| GP Ej | E import Dataset ~ | 4 E? Open Project... 
ESr-- ^ Introduction Global Environment ~ jecti à 
2 title: "Graphical Methods for Multivariate Linear Models in [M Problems in und... a CaP het ee 
Psychological Research: An R Tutorial” | Br Close Project 
: " " " rief review of ... 
3 shorttitle: "Graphical Methods for MLMs Tes 
ting contra... NN OREITIA : f 
4 author: Réposiod M Environment Psychological Methods Paper — 2; 
5 - name: Michael Friendly ae my iiis 4 
6 affiliation: 1 Example: Fallen... E 
7 corresponding: yes # Define only one corresponding author Graphical Method... matlib zi 
8 address: Psychology Department, York University, Toronto, Ontario, Data ellipsoids: ... p P 
Canada, M331P3 HE plots: Visuali... WordPools x 
9 email: friendly&yorku.ca Significance s... EqCov T) 
10 - name: Matthew Sigal Example: Fath... tk 
11 affiliation: 1 Visualizing co... — = 
12 affiliation: Low-D views: ... heplots 
13 - id: 1 Examples Files Plots Packages Help Viewer i J 
14 institution: York universit ivari i ENSE = 
45 y Multivariate multi... | 5) New Folder | | Delete L$] Rename | {| ! 
MANOVA designs vcdExtra a 
16 abstract: | MANCOVA and ... [7]- C: > Dropbox > Psychological Methods Paj 
I7 This paper is designed as a tutorial to highlight some Pergit ee ind a 
18 recent developments for visualizing the relationships among t. SE 
19 response and predictor variables in multivariate linear - Clear Project List 
1:1 £3 Graphical Methods for Multivariate Linear Models in Psychological Research: An R Tutorial $ R Markdown + T3 FriendlySigal2017-TQMP.pdf Project Options... 
È| Psychological Methods PaperRproj —z«rs-—— ——novrz;zuro-rrzoAWwr-T—7 
Console C:/Dropbox/Psychological Methods Paper/ eC Y 3 pops i ‘ 
CUpyrigrit (C7 ZULO ie K FOUNUALION Por Statistical Compucirig A ] TQMPpaper.zip 946.2 KB Nov 7, 2016, 8:23 AM 
Platform: x86_64-w64-mingw32/x64 (64-bit) El $3 Makefile 33KB Nov7, 2016, 8:21 AM 
R is free software and comes with ABSOLUTELY NO WARRANTY. - j Makefile.bak 34 KB Nov 6, 2016, 10:50 AM 
You are welcome to redistribute it under certain conditions. m = 
Type 'license()' or 'licence()' for distribution details. - L FIGLIST 591 B Nov 6, 2016, 10:43 AM 
] TQMP.dep 951 B Nov 6, 2016, 10:16 AM 


R is a collaborative project with many contributors. 


Type 'contributors()' for more information and EL TQMPpaperaux 147 KB Nov 6, 2016, 9:53 AM 
'citation()' on how to cite R or R packages in publications. E | TQMPpaper.log 80.3 KB Nov 6, 2016, 9:53 AM 
Type 'demo()' for some demos, 'help()' for on-line help, or | TQMPpaper.out 6 KB Nov 6, 2016, 9:53 AM 
“help. start()" for an HTML browser interface to help. T7 TQMPpaper.pdf 819.8 KB Nov 6, 2016, 9:53 AM 
Type 'qO' to quit R. 

| TQMPpaper.synctex.gz 286.1 KB Nov 6, 2016, 9:53 AM 


> k 9 Rhistory 135 KB Nov3,2016,938 AM ~ 


Organizing an R project 


* Use a separate folder for each project 
* Use sub-folders for various parts 


JON ü » Computer » LocalDisk(C:) » Dropbox » Documents 


File Edit View Tools Help 


Organize v Include in library v 


B Downloads 
E| Recent Places 


$$ Dropbox 


J VCDR 

Ji DDAR 

p VCD course 
data files: 
e raw data (.csv) 
* saved R data 


(.Rdata) 


s b 6135 » R » myproject > v +4 || Search myproject 


Share with v Burn New folder 


tes lysis.R read-mydata.R | 


figures: R files: 
e diagrams e data import 
* analysis plots * analysis mypeper md 


Write up files will 
go here (.Rmd, 
.docx, .pdf) 


Organizing an R project 


* Use separate R files for different steps: 
= Data import, data cleaning, ... > save as an RData file 


# read the data; better yet: use RStudio File -> Import Dataset ... 
mydata «- read.csv("data/mydata.csv") 


4 data cleaning .... 


# save the current state 
save("data/mydata.RData") 


Organizing an R project 


* Use separate R files for different steps: 


= Analysis: load RData, ... 


# analysis 
load("data/mydata.RData") 


# do the analysis — exploratory plots 
plot(mydata) 


# fit models 
mymod.1 <- Im(y ~“ X1 + X2 + X3, data=mydata) 


4 plot models, extract model summaries 
plot(mymod.1) 
summary(mymod.1) 


Graphics: Why plot your data? 


* Three data sets with exactly the same bivariate summary 
statistics: 
= Same correlations, linear regression lines, etc 
" |ndistinguishable from standard printed output 


Standard data r=0 but + 2 outliers Lurking variable? 


Roles of graphics in data analysis 


* Graphs (& tables) are forms of communication: 
=" What is the audience? 
= What is the message? 


Presentation graphs: design to attract 
attention, make a point, illustrate a 
conclusion 


Analysis graphs: design to see 
patterns, trends, aid the process of 
data description, interpretation 


Basic functions of data display 


Reconnaisance Perception 
Exploration Detection 
hiis dee Comparison 
Model building P 
Data Display 
to Simulate Aesthetics 
to Persuade Rhetoric 


to Inform 


Exposition 


The 80-20 rule: Data analysis 


e Often ~80% of data analysis time is spent on data preparation 
and data cleaning 
1. data entry, importing data set to R, assigning factor labels, 
2. data screening: checking for errors, outliers, ... 
3. Fitting models & diagnostics: whoops! Something wrong, go back to step 1 


* Whatever you can do to reduce this, gives more time for: 
=" Thoughtful analysis, 
" Comparing models, 
= Insightful graphics, 
" Telling the story of your results and conclusions 


This view of data analysis, 
statistics and data vis is now P adii 
rebranded as "data science" Import —* Tidy ——* Transform — —» Communicate 


Visualise 


DN Model 


Understand 


Program 
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The 80-20 rule: Graphics 


* Analysis graphs: Happily, 2096 of effort can give 8096 of a 
desired result 
=" Default settings for plots often give something reasonable 
= 90-10 rule: Plot annotations (regression lines, smoothed curves, data 
ellipses, ...) add additional information to help understand patterns, 
trends and unusual features, with only 10% more effort 
* Presentation graphs: Sadly, 80% of total effort may be 
required to give the remaining 20% of your final graph 
=" Graph title, axis and value labels: should be directly readable 


=" Grouping attributes: visually distinct, allowing for BW vs color 
* color, shape, size of point symbols; 
* color, line style, line width of lines 


= Legends: Connect the data in the graph to interpretation 
= Aspect ratio: need to consider the H x V size and shape 
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What can | do with R graphics? 


A wide variety of standard plots (customized) 


Responses per Travel 


tooth length 


D 


h 


line graph: plot() 


cond 


e 


c 


D 


Responses per 


Density 


Histogram of Y 200 
hist() 
150 
100 
50 
T T T T T T 13 
3 2 4 0 4 2 3 
0 


barchart() 


Bivariate plots 


R base graphics provide a wide variety of different plot types for bivariate data 


The function plot(x, y) isgeneric. It produces different kinds of plots depending 
on whether x and y are numeric or factors. 


plot (num,num) smoothScatter () sunflowerplot () 


Some plotting 


* functions take a 
X X matrix argument & 
o T A plot all columns 
f 
boxplot (list) : : - 
plot (fac, num) barplot (matrix) barplot (matrix) dotchart (matrix) 
— 
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Bivariate plots 


A number of specialized plot types are also available in base R graphics 


Plot methods for factors and tables are designed to show the association between 
categorical variables 


The vcd & vcdExtra 


stripchart (list) ; 
plot (num, fac) ere AUR LEN et packages provide more 
and better plots for 
categorical data 
spineplot (fac, fac) mosaicplot () 


plot (fac, fac) assocplot () fourfoldplot () plot (table) 


i=. @ 


- ‘OE 
^" wo ^ " 


un 


Sndanized 
Residuals 
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Mosaic plots 


Similar to a grouped bar chart Brown Blue Hazel Green 


Shows a frequency table with tiles, 
area ^' frequency 


Black 


» data(HairEyeColor) 
> HEC «- margin.table(HairEyeColor, 1:2) 
» HEC " 
Eye Š 119 

Hair Brown Blue Hazel Green a. 

Black 68 20 15 5 I 

Brown 119 84 54 29 

Red 26 17 14 14 

Blond 7 94 10 16 " 


» chisq.test(HEC) 
Pearson's Chi-squared test 


data: HEC 
X-squared - 140, df - 9, p-value «2e-16 


Re 
Ü 
^d 


Blond 
bs | 


How to understand the association 
between hair color and eye color? 


2 


[92] 


Mosaic 2 


Brown Blue HazeGreen 
Shade each tile in relation to the 


Pearson 
residuals: 
contribution to the Pearson y? 


statistic 40 
2 
O. —e@. 2 2.0 
pg-yg2yeree "uo 
e; 0.0 


-2.0 
» round(residuals(chisq.test(HEC)), 2) 


: i 
Eye 
Hair Brown Blue Hazel Green ae MES 
Black 4.40 -3.07 -0.48 -1.95 «2e-16 


Brown 1.23 -1.95 1.35 -0.35 
Red -0.07 -1.73 0.85 2.28 
Blond -5.85 7.05 -2.23 0.61 


Black 


Brown 


Hair 


Red 


Blond 


Discrete Data 
Analysis with R 


Mosaic plots extend readily to 3-way + tables 


They are intimately connected with loglinear models 
See: Friendly & Meyer (2016), Discrete Data Analysis with R, http://ddar.datavis.ca/ 
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Follow along 


From the course web page, click on the script 
duncan-plots.R, 
http://www.datavis.ca/courses/RGraphics/R/duncan-plots.R 


Select all (ctrl--A) and copy (ctrl--C) to the clipboard 
In R Studio, open a new R script file (ctrl--shift--N) 
Paste the contents (ctrl+V) 

Run the lines (ctrl--Enter) along with me 


Multivariate plots 


The simplest case of multivariate plots 
is a scatterplot matrix — all pairs of 
bivariate plots prestige 


In R, the generic functions plot ( ) 
and pairs() have specific methods 
for data frames 


data(Duncan, package-"car") 

plot(- prestige + income + education, 
data-Duncan) 

pairs(- prestige + income + education, 
data-Duncan) 


80 100 


60 


education 


20 


20 40 60 80 100 
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Multivariate plots 


These basic plots can be enhanced in 
many ways to be more informative. 


The function scatterplotMatrix() in the 

car package provides 

* univariate plots for each variable 

e linear regression lines and loess 
smoothed curves for each pair 

* automatic labeling of noteworthy 
observations (id.nz) 


library(car) 


oRR.epgineer 
7 
o 
p 
Pa o 
z 


oRR.engineer 
conductor NT 


£ 


c» 


# 


20 40 60 80 100 20 40 60 80 100 


scatterplotMatrix(-prestige + income + education,  data-Duncan, id.n-2) 
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Multivariate plots: corrgrams 


For larger data sets, visual corrgram(cars[,3:9], upper.panel-panel.cor, diag.panel-panel.density) 


summaries are often more useful 
than direct plots of the raw data 


A corrgram ("correlation diagram") B. 0.95 0.84 0.90 -0.52 
allows the data to be rendered in a 


variety of ways, specified by panel gne 0.90 0.93 -0.56 


functions. 
d 0.87 


Here the main goal is to see how -0.70 


mpg is related to the other 
variables 


-0.43 


[PR or -0.80 -0.78 -0.83 0.42 0.58 


-0.36 


-0.38 


-0.42 


-0.32 


fe os 


ZANSSSBISS 
| f 


See: Friendly, M. Corrgrams: Exploratory displays for correlation matrices. The American Statistician, 2002, 56, 316-324 
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Multivariate plots: corrgrams 


For even larger data sets, more xe 

abstract visual summaries are rood ag Reh Seco e! ox oe AKIN " 
necessary to see the patterns of Ash // 

relationships. color C Qa "4 i 


MalicAcid Fa LX Y ET ! 
This example uses schematic ites VA x WWSVe T 
AlcAsh P VA. | eee € 


ellipses to show the strength and 


And | we W€wWco/esco B 
direction of correlations among 
variables on a large collection of iin SNe . 
Italian wines. Proa Q 09^ | 
Phenols LL ^ 4^4 APA LÀ "i 
Here the main goal is to see how Proline LY Y rr YY Wa VN ax 
the variables are related to each Mg 7. -0.8 
other Alcohol ^ 1 @ e Ti WA 


library(corrplot) 
corrplot(cor(wine), tl.srt=30, method="ellipse", orderz"AOE") 


See: Friendly, M. Corrgrams: Exploratory displays for correlation matrices. The American Statistician, 2002, 56, 316-324 
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Generalized pairs plots 


Generalized pairs plots from the gpairs 
package handle both categorical (C) and 
quantitative (Q) variables in sensible ways 


x y pot a 


scatterplot 


Q 

Q boxplot 
C barcode 
C 


Q 
C 
Q 
C 


mosaic 


8888 
L 


Placebo Treated 


H 


Female 


library(gpairs) 
data(Arthritis) 
gpairs(Arthritis[; ¢(5,.2:5) |, 7.) 
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Models: diagnostic plots 


Linear statistical models (ANOVA, 
regression), y = X B + £, require some 
assumptions: € ~ N(O, o?) 


For a fitted model object, the plot() 
method gives some useful diagnostic 


plots: 

e residuals vs. fitted: any pattern? 

e Normal QQ: are residuals normal? 
e  scale-location: constant variance? 
e  residual-leverage: outliers? 


Residuals 


AlStandardized residualsl 


Residuals vs Fitted 


Fitted values 


Scale-Location 


@minister 


Fitted values 


Standardized residuals 


Standardized residuals 


Normal Q-Q 


-2 4 0 1 2 
Theoretical Quantiles 


Residuals vs Leverage 
M T = 


Se scm 
mite ite | 
EX e* weed 
. 
4 e 
ore i . 
% 
2l e e. 
% «v. 
= s e 
* 
|| P conductor alse 
é NL 
- CooRF distance — 77 . 


0.00 0.05 0.10 0.15 020 025 


Leverage 


duncan.mod <- lm(prestige ~ income + education, data-Duncan) 


plot(duncan.mod) 
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Models: Added variable plots 


The car package has many more functions for plotting linear model objects 
Among these, added variable plots show the partial relations of y to each x, holding all 
other predictors constant. 


library(car) 
avPlots(duncan.mod, id.n=2,ellipse=TRUE, ..) 


Added-Variable Plots 

Each plot shows: 
partial slope, B; 
influential obs. 


RR.engineere 


o o 
z i 
o © .|eminister o 
a gonductore zy 
I] E] 
5» > = = 
2 2 
©. n. 
= o 
i My 
e 
ig 
e ew 
is ereporter y 
e? 
-40 -20 0 20 40 -60 -40 -20 0 20 40 
income | others education | others 
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Models: Interpretation 


Fitted models are often difficult to interpret from tables of coefficients 


4 add term for type of job 
duncan.modi <- update(duncan.mod, . ~ . + type) 
summary (duncan.mod1) 


Call: 
lm(formula = prestige - income + education + type, data = Duncan) 


Coefficients: 
Estimate Std. Error t value Pr(>|t]) 


(Intercept)  -0.18503 3.71377 -0.050 0.96051 How to understand 
income 0.59755 0.08936 6.687 5.12e-08 *** 

education 0.34532 0.11361 3.040 0.00416 ** effect of each 
typeprof 16.65751 6.99301 2.382 0.02206 * predictor? 

typewc -14.66113 6.10877 -2.400 0.02114 * 


Signif. codes: © ***** 0.001 '**' 0.01 **" 0.05 ^.” 0.1 * * 1 
Residual standard error: 9.744 on 40 degrees of freedom 
Multiple R-squared: 0.9131, Adjusted R-squared: 0.9044 
F-statistic: 105 on 4 and 40 DF, p-value: « 2.2e-16 
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Models: Effect plots 


Fitted models are more easily interpreted by plotting the predicted values. 
Effect plots do this nicely, making plots for each high-order term, controlling for others 


library(effects) 
duncan.effi <- allEffects(duncan.mod1) 
plot (duncan. eff1) 


income effect plot education effect plot type effect plot 


70 70 60 


60 60 
50 


50 
50 


prestige 
prestige 
prestige 


40 
40 
40 
30 
30 


30 20 


10 20 30 40 50 60 70 80 20 30 40 50 60 70 80 90 100 be prof we 
income education type 37 


Models: Coefficient plots 


Sometimes you need to report or display the coefficients from a fitted model. 
A plot of coefficients with Cls is sometimes more effective than a table. 


library(coefplot) 


duncan.mod2 <- lm(prestige ~ income * education, data-Duncan) 
coefplot(duncan.mod2, intercept-FALSE, lwdInner-2, lwdOuter-1, 
title="Coefficient plot for duncan.mod2") 


Coefficient plot for duncan.mod2 


income:education LÀ 


education 


Coefficient 


income 
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Coefficient plots become 

increasingly useful as: 

(a) models become more complex 

(b) we have several models to 
compare 


This plot compares three different 
models for women's labor force 
participation fit to data from Mroz 
(1987) in the car package 


This makes it relatively easy to see 
(a) which terms are important 
(b) how models differ 


This example from: https://www.r-statistics.com/2010/07/visualization-of-regression-coefficients-in-r/ 


inc 


Iwg 


hcyes 


wcyes 


age 


k618 


k5 


(Intercept) 


Regression Estimates 
-2 0 2 4 6 


family income - wife's income 


i 
ptum log wage rate for working women 
i —— 
| 
— husband's college attendance 
——— 


wife's college attendance 


E. number of children 6-18 
—- 


number of children 5 years + 


Y) 
= 
G 

e 

© 

v 
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O 
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R has a wide variety of features and 
packages that support 3D graphics 


This example illustrates the concept 
of an interaction between predictors 


in a linear regression model 


It uses: 


.) 


lattice::wireframe(z ~ x + y, .. 


TLLA 


printed” 36 times 
about the z axis to 


ut 


The basic plot is 
rotated 10° 


LI 
- 
W 


produce 36 PNG images. 


The ImageMagick utility is used to 


convert these to an animated GIF 


graphic 


*y 


10 + .5x+.3y+.2x 


Z= 
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3D graphics: code 


1. Generate data for the model z = 10 + .5x +.3y + .2 x*y 


bO «- 10 # intercept 
b1 <- .5 # x coefficient 
b2 <- .3 # y coefficient 


int12 <- .2 # x*y coefficient 

g <- expand.grid(x = 1:20, y = 1:20) 

g$z <- bO + b1*g$x + b2*g$y + inti2*g$x*g$y 
2. Make one 3D plot 


library(lattice) 
wireframe(z ~ x * y, data = g) 


3. Create a set of PNG images, rotating around the z axis 


png(file="example%03d.png", width=480, height=480) 
for (i in seq(0, 350 ,10)){ 
print(wireframe(z ~ x * y, data = g, 
screen = list(z = i, x = -60), drape=TRUE) )} 
dev.off() 


4. Convert PNGs to GIF using ImageMagik 
system("convert -delay 40 example*.png animated 3D plot.gif") 
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3D graphics 


The rgl package is the most general for 
drawing 3D graphs in R. 

Other R packages use this for 3D statistical 
graphs 


This example uses car::scatter3d() to 
show the data and fitted response surface 
for the multiple regression model for the 


Duncan data Vac 


scatter3d(prestige ~ income + education, 
data-Duncan, id.n-2, revolutions-2) 
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Statistical animations 


Statistical concepts can often be 
illustrated in a dynamic plot of some 
process. 


This example illustrates the idea of 
least squares fitting of a regression 
line. 


As the slope of the line is varied, the 
right panel shows the residual sum 
of squares. 


This plot was done using the animate 
package 


10 12 14 


esidual Sum of Squares 
500 


R 
500 


800 


700 


400 


10 


20 30 
Slope = 0.325 


40 
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Data animations 


Year: 1952 
. : Africa Americas Asia 
Time-series data are often plotted uH 
against time on an X axis. 
.* 
LJ 
ee LÀ 
. , 60- M P 
Complex relations over time can i ee 
RR "c e? “ae 
often be made simpler by animating e > e e 
. s . 407 a 
change - liberating the X axis to 4 - . , 
. e E 
show something else B f 
& ss Sch 1e*03 1e*04 1e*05 
This example from the tweenr £ so- 
package (using gganimate) A e 
60- T 
o 
40- 
1e+03 1e*04 1e405 1e*03 1e404 1e405 
GDP per capita 


See: https://github.com/thomasp85/tweenr for some simple examples 
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Maps and spatial visualizations 


Spatial visualization in R, combines map data sets, statistical models for spatial data, 
and a growing number of R packages for map-based display 


This example, from Paul Murrell's R 
Graphics book shows a basic map of 
Brazil, with provinces and their capitals, 
shaded by region of the country. 


Data-based maps can show spatial 
variation of some variable of interest 


Bele 


m 
Sao Luis’ 


z 
eresind? Na 
7 .— Joao Pesso: 
í A ara 


Palma Wass 


D 
Salvado 


lL Cuiab® Brasili® 
b Goiani® 

canon can belo Horizonte E 
Nd 


Murrell, Fig. 14.5 
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Maps and spatial visualizations 


Dr. John Snow's map of cholera in 
London, 1854 


Enhanced in R in the HistData 
package to make Snow's point 


Portion of Snow's map: 


Snow's Cholera Map, Death Intensity 


á 
Warwick 


| 
| 
L 


library(HistData) 
SnowMap(densityZ TRUE, 
main=“Snow's Cholera Map, Death Intensity") 


Contours of death densities are calculated using 
a 2d binned kernel density estimate, bkde2D() 
from the KernSmooth package 46 


Maps and spatial visualizations 


Dr. John Snow's map of cholera in 
London, 1854 


Enhanced in R in the HistData 
package to make Snow's point 


These and other historical 
examples come from Friendly & 
Wainer, The Origin of Graphical 
Species, Harvard Univ. Press, in 
progress. 


Snow's Cholera Map with Pump Neighborhoods 


SnowMap(densityZ TRUE, 
main-"Snow's Cholera Map with Pump Neighborhoods") 


Neighborhoods are the Voronoi polygons of the 
map closest to each pump, calculated using the 
deldir package. 
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Diagrams: Trees & Graphs 


A number of R packages are specialized to draw particular types of diagrams. "e 
igraph is designed for network diagrams of nodes and edges le? 
library(igraph) 
tree «- graph.tree(10) full <- graph.full(10) 
tree <- set.edge.attribute(tree, "color", value="black") fulllgraph «- set.edge.attribute(full, "color", 
plot(treelgraph, | | value="black") 
layout-layout.reingold.tilford(tree, plot(full, layout=layout.circle) 


root=1, flip.y=FALSE)) 


Ww 


Se 


j 


NE 


SA 
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Diagrams: Network diagrams 


graphvis (http://www.graphviz.org/) is a comprehensive program for drawing 
network diagrams and abstract graphs. It uses a simple notation to describe nodes 


and edges. 
The Rgraphviz package (from Bioconductor) provides an R interface 


This example, from Murrell's R Graphics 
book, shows a node for each package that 
directly depends on the main R graphics 
packages. 


An interactive version could provide "tool 
tips", allowing exploring the relationships 
among packages 


Otz 


—— 
Murrell, Fig. 15.5 HU 
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Diagrams: Flow charts 


The diagram package: 


Functions for drawing diagrams with 
various shapes, lines/arrows, text 
boxes, etc. 


do you 
understand flow 
charts? 


you see 
the lines labeled 


yes? 


you see 


the lines labeled 


"no"? 


you see 
the lines labeled 
"no? 


Flow chart about understanding flow charts (after 


http://xkcd.com/518 ). From: Murrell, Fig 15.10 
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Path diagrams: structural equation models 


Similar diagrams are used to display structural equation models as "path diagrams" 
The sem and laavan packages have pathDiagram() functions to draw a proposed or 
fitted model. 

They use the DiagrammeR package to do the drawing. 


library(sem) 
union.mod <- specifyEquations(covs-"x1, x2", text=" 
y1 = gam12*x2 
y2 = beta21*y1 + gam22*x2 
y3 = beta31*y1 + beta32*y2 + gam31*x1 
") 
union.sem <- sem(union.mod, union, N=173) 
pathDiagram(union.sem, 
edge.labels="values", 
file="union-sem1", 
min.rank=c("x1", "x2")) 


Sentiment 


0.86 
-0.22 
Deference 


E -028 0.85 


Activism 
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Dynamically updated data visualizations 


The wind map app, http://hint.fm/wind/ is one of a growing number of R-based 
applications that harvests data from standard sources, and presents a visualization 


Sept. 24, 2018 


iH 


M 
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Web scraping: CRAN package history 


R has extensive facilities for extracting and processing information obtained from web 


pages. The XML package is one useful tool for this purpose. 


This example: 
e downloads information about all R 


packages from the CRAN web site, 890 = o 
e finds & counts all of those available for : 
each R version, is tei] 


e plots the counts with ggplot2, adding a 
smoothed curve, and plot annotations 4o 


2000 


On Jan. 27, 2017, the number of R 2006 2008 2010 Hs 
packages on CRAN reached 10,000 


Code from: https://git.io/vyAwS 


Number of CRAN packages per R version 


o 
m 
c 
2 
$ 
€ 
o 
> 
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shiny: Interactive R applications 


shiny, from R Studio, makes it easier to develop interactive applications 


Shiny User Showcase 
The Shiny User Showcase contains an inspiring set of sophisticated apps developed and contributed by Shiny users. 


= m 
P E E Wet cev Pas ] hix ies 
oe ` = == T — 
^ | 
1 \ 
4 | = ~= . 
* 4 7 COA LM j 
Nias 
=! 
Genome browser Papr Lego Set Database Explorer See more 


Interactive visualizations 
Shiny is designed for fully interactive visualization, using JavaScript libraries like d3, Leaflet, and Google Charts. 


Movie explorer 


SuperZip example Bus dashboard Movie explorer Google Charts 


Many examples at https://shiny.rstudio.com/gallery/ 
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Reproducible analysis & reporting 


JOE. E , eee RSI. View POF 
710 ?- @ mo 


title: "Welcome to R Markdown” 
author: "RStudio" 

date: "December 18, 2014" 
output: pdf document 


Sn OS VS oS IND 


This is on R Morkdown document. 
simple formatting syntox for out 
U MS Word documents. For 


click the **Knit** butt 
generated thot includes 
as the output of ony est 
chunks within the document. 


ir} 
13 coef(lm(dist - speed, data = ca 


TÉ Ya cn olen amhend ninie fna 
13 Top Leve? 


Comoe R Markdown 


R Studio, together with the knitr 
and rmarkdown packages provide 
an easy way to combine writing, 
analysis, and R output into 
complete documents 


.Rmd files are just text files, using 
rmarkdown markup and knitr to 
run R on “code chunks” 


A given document can be 
rendered in different output 
formats: 


|| Q | a knit ~ 
9 Knit to HTML | 
7*1 Knit to PDF 

W] Knit to Word 

Knit with Parameters... 
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Output formats and templates 


| The integration of R, R Studio, knitr, 
t rmarkdown and other tools is now 


highly advanced. 


My last book was written 
entirely in R Studio, using .Rnw 
syntax — LaTeX — PDF — 
camera ready copy 


The ggplot2 book was written 
using .Rmd format. 


The bookdown package makes 
it easier to manage a book- 
length project — TOC, fig/table 
Hs, cross-references, etc. 


Templates are available for APA papers, 
slides, handouts, entire web sites, etc. 
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Writing it up 


* |n R Studio, create a .Rmd file to use R Markdown for 
your write-up 
= lots of options: HTML, Word, PDF (needs LaTeX) 


File | Edit Code View Plots Session Build Debug Profile Tools Help New R Markdown 
New File > i j 
, R Script Ctrl+Shift+N Decent Title: | B 
New Project... 5 : 
R Notebook DA Presentation Author: |Michael Friendly 
Open File... Ctrl-O OUT $) Shiny DS " 
Reopen with Encoding... N 2 3 
a Shiny Web App... From Template @ HTML 
Recent Files L Recommended format for authoring (you can switch to PDF 
Text File or Word output anytime). 
Open Project... 
- : E C++ File © PDF 
Open Project in New Session... PDF output requires TeX (MiKTeX on Windows, MacTeX 
Recent Projects > R Sweave 2013+ on OS X, TeX Live 2013+ on Linux). 
RHTML Ow 
Import Dataset » z g : : i 
R Presentation Previewing Word documents requires an installation of MS 
= Word (or Libre/Open Office on Linux). 
Save Ctrl+S R Documentation 
Save As... 
OK Cancel 


58 


Writing it up 


* Use simple Markdown to write text 


* Include code chunks for analysis & graphs 


mypaper.Rmd, created from a template 


© | analysis.R O | read-mydata.R Q | mypaper.Rmd 
ÁN I S Knit ~ Y 

BE --- 

2 title: "mypaper" 

3 author: "Michael Friendly" 

4 date: "January 29, 2018" yaml header 
5 output: html document 

6 — 

7 

8- ```{r setup, include=FALSE} 

9 knitr::opts_chunk$set(echo = TRUE) 

309 7 

11 

12- ## R Markdown Header 2 
13 

14 This is an R Markdown document. Markdown is a simple formatting 

details on using R Markdown see «http: //rmarkdown. rstudio. com>. 
15 
16 when you click the **Knit** button a document will be generated 
R code chunks within the document. vou can embed an R code chunk 

17 
18- ``{r cars} 
= summary (cars) output code chunk 
21 

22+ ## Including Plots 

23 
24 You can also embed plots, for example: 
25 
26~ QU pressure, echo=FALSE} 
27 plot(pressure) plot code chunk 
28 ^77 


Help -» Markdown quick reference 


Files Plots Packages Help Viewer aC 
^s 


^ 
Markdown Quick Reference * 


Markdown Quick Reference 


R Markdown is an easy-to-write plain text format for creating dynamic 
documents and reports. See Using R Markdown to learn more. 


m 


Emphasis 
*italic* **bold** 
_italic_ __bold_ 


Headers 
# Header 1 
## Header 2 


### Header 3 


Lists 
Unordered List 


* Item 1 

* Item 2 
* Item 2a 
* Item 2b 


Ordered List 


1. Item 1 

2. Item 2 

3. Item 3 
* Item 3a 
* Item 3b 
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rmarkdown basics 


rmarkdown uses simple markdown formatting for all standard document elements 


^[Desktop/example.html 
Q1 example.Rmd * -[e99*9 l p/examp 


example.html | gi | Open in Browser Find 


Header 1 


This is an R Markdown document. Markdown is a 
simple formatting syntax for authoring web pages. 


T) Ee Q | am Knit HTML ~ S ~ @ 
~ # Header 1 


i 
2 
3 This is an R Markdown document. Markdown is a 
simple formatting syntax for authoring webpages. 


5 Use an asterisk mark to provide emphasis, such 


*4 1 * *ck ** 
dS eae op T BORNE Use an asterisk mark to provide emphasis, such as 


6 TG 

7 Create lists with a dash: italics or bold. 

: — Create lists with a dash: 
- Item 

10 - Item 2 e |tem 1 

11 = Item 3 e. Item 2 

"s e |tem 3 


14 Use back ticks to 


15 create a block of code Use back ticks to 


create a block of code 


18 Embed LaTex or MathML equations, Embed LaTex or MathML equations, + 57 , x; 
19 $\frac{1}{n} \sum_{i=1}4{n} x_{i}$ 
Or even footnotes, citations, and a bibliography. ' 
21 Or even footnotes, citations, and a 


bibliography. [A1] 


22 

i e 
23 [^1]: Markdown is great. 1. Markdown is great. 
11 © Header 1 + R Markdown 2 
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R code chunks 


R code chunks are run by knitr, and the results are inserted in the output document 


©) chunks.Rmd x mE aTi) RStudio: Preview HTML 
Siere ; = — -— There are many 
DUE [moj << Knit HTML ->| o> Chunks » || Preview: ~/chunks.htm! | Æ) | fH Save As 4 Publish . : 
1 R Code Chunks options for controlling 
o ———————— : 
3 R Code Chunks the details of chunk 
4 With R Markdown, you can insert R code With R Markdown, you can insert R code chunks including plots: EN 
|! chunks including plots: output numbers, 
5 # quick summary and plot 
6- """(r aplot, fig.width-4, fig.height-3, library(ggplot2) tables, graphs 
| message=FALSE} summary( cars) 
7 # quick summary and plot 
8 library(ggplot2) : 
[2 speed dist 
9 summary(cars) ; . ; š 
10 qplot(speed, dist, data=cars) + n ^ot eie "x : = Choose the output 
om. geon-smocthO ## Median :15.0 Median : 36 format: 
12 ## Mean :15.4 Mean :43 
13 ## 3rd Qu.:19.0 3rd Qu.: 56 
## Max. :25.0 Max.  :120 NIBI F @ | wv Ki - 


®) Knit to HTML | 
=) Knit to PDF 
W] Knit to Word 


qplot(speed, dist, data = cars) + geom smooth() 


An R chunk: 
"(ir name, options} 


100 - Knit with Parameters... | 


dist 


# R code here 


ANN 
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The R Markdown Cheat Sheet provides most of the details 
https://www.rstudio.com/wp-content/uploads/2016/03/rmarkdown-cheatsheet-2.0.pdf 


oo Rr thes Reproducible Research $ B8 Oynar Docements 
R Markdown Cheat Sheet LOL MB fbs a a weard ol At the Chek of a barion œ thw type i = è eran choone 15 export ite 
ho 4 prea tortem fe code hat jii ofa command you can ern the g- treated report ai a hem p 
learn more at rmarkdown.rstedio.com Om Hem Va mpm foe yeu mei mc | eode e an K Martciown the 15 E a M3 Word, COT, TI, or 
g rg WA T narrateon Tit à read wpetduee por mor red expert ~ arbor do vorm od ansa 
(sudo P He vum aa a tnte report QQ Mehr eat ene te tem 


new Aired fle ios o e 2 eem g theme ts cse pen. qmm 
ha the entered hat one No (ov iind Aa Pa Rr orto a Pyme Pompa sr bet tumor oc vata () to kat 


s Mostra 


MU 
tr enun at 


Reheat nd 
Fed n dox amant 
~~ Akpa 
cae hel 


mon oF themes arat Vater m a meee 
fuere peus tr arare HTML CHOR anat MS Yen 


inline code 
Weert wi vecnáe Renin icon a tet without iod ptr per inr "qm 
Utt Pam 


[ELA NE 


—ÓÓ —Á— Spa Mo WR nter fete — Amm 


rm c mel 7. m rene meacres? 
OMM, ei inia ir ee rua engine, oe lingapin narh N E NO put i cd betw oh God 


Set 
Sas Race a perra 
bj pubis P Oe onl: val Pus cade cu fata = — bin nad RENE Se pass piunt 


ATTER ENEE UO MT TS I (71 v d- Tree, =| e 


ENOO & a Tae USES E e rete: POBI * Mee dio" mc LS mme oa ILLI LLL AMNES D S S ÉL LO 


R notebooks 


Often, you just want to "compile" an R script, and get the output embedded in the 
result, in HTML, Word, or PDF. Just type Ctrl-Shift-K or tap the Compile Report button 


5 


E = Run | 9% || 9 
Compile Report 
(Ctrl+Shift+K) 
© Rstudio 
File Edit Code View Plots Session Build Delfug Profile Tools Help 
Qi. e [55] +! Addins ~ 
@)3d-demo.R* — Qj3d-demo.Rmd x — 6 jinv-exl.R ec 
ri Source on Save q Á- E -9*Run [b+ | $9Soure x = 
ENS" —- a 
2 #' title: "Inverse of a matrix" 
3 #' author: "Michael Friendly" 
4 #° date: "07 Sep 2016" 
5 #° output: html document E 
6 $' --- 
7 
8 #' The following examples illustrate the basic properties of the inver 
9 s 
10 #' ### Load the ^matlib' package| 
11 #' 
12 #' This defines: `inv()`, `Inverse()`; the standard function for matr 
13 
14 library(matlib) 
15 
16 #' ### Create a 3 x 3 matrix 
17 A <- matrix( c(5, 1, O, 
18 3;-1; 2; 
19 4, 0,-1), nrow=3, byrow=TRUE) 
20 det(A) 
21 
22 #' ### 1. det(A) != 0, so inverse exists 
23 (AI <- inv(A)) 
24 
25 #' ### 2. Definition of the inverse: $AM{-1} A=AA = I$ or `AI 
26 #' - 
27 E m D 
10:33 _ (Top Level) + R Script = 
Console R Markdown * em C1 
Type demo) tor some demos, “help()~ tor on-line help, or x 


'help.start()' for an HTML browser interface to help. 
Type 'qO' to quit R. 


[workspace loaded from ~/.RData] 


- 


1 [m] 


————— 


© Ci/Dropbox/Documents/6140/R/inv-ex..html 


I Open in Browser 


nv-ex1.html “S- Publish 


Inverse of a matrix 


Michael Friendly 
07 Sep 2016 


The following examples illustrate the basic properties of the inverse of a matrix 


m 


Load the matlib package 


This defines: inv() , Inverse() ; the standard function for matrix inverse is solve () 


library (matlib) 


Create a 3 x 3 matrix 


A «- matrix( c(5, 1, 0 
3,-1, 
4, 0,-1), nrow-3, byrow-TRUE) 
det (A) 
## [1] 16 


1. det(A) != 0, so inverse exists 


(AI <- inv(A) 


Lj 
Lj 
Lid 
H 


Li] [,2] 
[1,] 0.0625 0.0625 0.125 
[2,] 0.6875 -0.3125 -0.625 
[3,] 0.2500 0.2500 -0.500 


(,3] 


2. Definition of the inverse: A ! A = AA !=Tor 
AI * A = diag(nrow(A) ) 


NB: Sometimes you will get very tiny off-diagonal values (like 1.341e-13 ). The function 
zapsmall() will round those to 0 


AI $** A 


Summary & Homework 


* Today has been mostly about an overview of R 
graphics, but with emphasis on: 
= R, R Studio, R package tools 
= Roles of graphics in data analysis, 


= A small gallery of examples of different kinds of graphic applications in 
R; only small samples of R code 


=" Work flow: How to use R productively in analysis & reporting 
* Next week: start on skills with traditional graphics 


e Homework: 
" Install R & R Studio 


= Find one or more examples of data graphs from your research area 
e What are the graphic elements: points, lines, areas, regions, text, labels, ??? 
* How could they be "described" to software such as R? 
* How could they be improved? 
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